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METHODS AND APPARATUS FOR DATABASE 
SPACE CALCULATION AND ERROR DETECTION 
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Field of the Invention 

The present invention relates generally to improved techniques for more accurately 
analyzing the additional capacity of hierarchical databases, such as those utilizing the IMS™ 
management software from IBM. More particularly, the present invention provides tools to 
10 look at physical sequential data files, such as the OSAM files used in IMS databases, to 
determine how much file space an IMS OSAM dataset, for example, is using in multiple 
virtual storage (MVS) or on a mainframe computer to proactively determine if the database is 
running out of space, thereby allowing a user to take steps to prevent the problems incumbent 
when a database runs out of space. 



Background of the Invention 

By way of example, if an IMS OSAM database exceeds a predefined system 
maximum capacity, such as about 8.4 gigabytes (Gb) or 8.4 billion bytes, it will fail and stop 
functioning. At the time the present invention was made, there was not a readily available 
20 way to be proactive and prevent such failures from happening. Existing tools did not make 



the needed information available. In fact, it was determined that existing interactive system 
productive facility (ISPF) and interactive storage management facility (ISMF) tools could not 
be reliably used for space determination for datasets guaranteed space because the dataset 
control blocks (DSCBs) were not correctly updated, as discussed further below. Regardless 
25 of the predefined system capacity, most users will want to accurately know when system 
limits are being approached so that proactive steps can be taken. 

A product that came with BMC IMS utilities was located, but that product was 
particular to its own internal processes during execution of the BMC IMS utilities and built 
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its processes thereupon. Consequently, this product was not applicable to IMS OSAM 
systems and the like. 

Summary of the Invention 
5 In light of the above, there was a critical need for tools to address the problems 

presented by the specific context of IMS OSAM datasets in particular and large hierarchical 
databases utilizing very large physical sequential files in general. 

Among its several aspects, the present invention provides techniques for accurately 
monitoring the order in which a large hierarchical database is stored to multiple disks, 

10 adjusting the percentage volume full or the measure of disk fullness to reflect that when a 
second or subsequent disk is full or partially full then a first disk is or previously filled disks 
are full, and automatically reporting when a threshold volume is exceeded so that a user can 
respond as appropriate. In the specific context of an IMS OSAM dataset, by looking at a 
DCOLLECT listing, determining the data attributes of the dataset, such as whether it is 

15 guaranteed space, the volumes or disks that the dataset is on, and the order in which the 

dataset is allocated to the volumes, an actual file size is determined, as well as how close that 
size is to approaching the limit of approximately 8.4 Gb. Appropriate reports are then 
generated. 

These and other features, aspects, and advantages of the invention will be apparent to 
20 those skilled in the art from the following detailed description taken together with the 
accompanying drawings. 

Brief Description of the Drawings 

Fig. 1 illustrates a system in accordance with the present invention; 
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Figs. 2A-2S illustrate a method of database space calculation and error detection in 



accordance with the present invention; 



Fig. 3 is a flow chart for an exemplary SUBLISTC subroutine for use in conjunction 



with the method of Figs. 2A-2S; 
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Figs. 4A-4C are a flow chart illustrating an exemplary SUB009 subroutine for use in 




=D 10 method of Figs. 2A-2S; 

y* Figs. 7A and 7B illustrate an exemplary SUBL002 subroutine for use in conjunction 

W with the method of Figs. 2A-2S; and 

Fig. 8 is an illustrative report which may be generated by the system and methods of 
W the above Figs, to provide a notification to a user that a database is over a used space 

O 15 threshold. 

Detailed Description 

The present invention addresses methods and apparatus for analyzing IMS databases 
on a mainframe computer, and specifically looking at the OSAM files that are associated with 

20 those databases. As indicated above, if an IMS OSAM database exceeds a designed 

maximum capacity, such as approximately 8.4 Gb, it will fail and stop functioning. Before 
turning to the details of a presently preferred IMS OSAM free space monitor in accordance 
with the present invention, the following definitions are provided: 
IMS - A hierarchical database management software product from IBM. 

25 OSAM- A physical sequential file that is used specifically by an IMS database. 
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SMS - System managed storage software from IBM that manages the placement of files in 
physical volumes of a mainframe computer. 

IDCAMS - A utility from IBM that provides catalog and volume information as well as basic 
information about physical files. 
5 DCOLLECT- A feature of IDCAMS that will return a file containing volume information 
and attributes about the files. 

GTS - Guaranteed space is a feature of SMS that allows a file to be allocated specifically on 
specific disk storage volumes. 

ACS- The code that SMS uses to determine where a file is to physically be allocated. 
10 LISTCAT- A feature of IDCAMS that will return information about a specific file of catalog. 
IEHLIST- A utility that will return specific data about a particular volume. The data is 
similar to a DCOLLECT, but in a different format. 

Data Class - An attribute of SMS that will assign certain characteristics to certain files. This 
attribute is used to group similar types of files together, in terms of their allocations. For 
15 example, all files that are defined as being guaranteed space will have a data class ending in 
"GTS" according to the presently preferred embodiment of the present invention. 



Turning to Fig. 1, this figure shows an IMS OSAM free space monitor system 100 in 
'accordance with the present invention. System 100 includes a source of online transaction 
20 data 1 10, a mainframe computer 120, three disk storage memories, volumes, or disks 1, 2, 
and 3 130, 140, and 150, IMS OSAM collection process software 160 which includes 
standard IMS OSAM software modified to include the processes and subroutines taught in 
the present invention and discussed in detail below In connection with Figs. 2A-2S, 3, 4A-4C, 
5A-5C, 6, 7A and 7B, and an exception report generatoKl70, such as a printer or display, for 
25 providing an exception report to a user, such as exemplary email report 800 shown in Fig. 8 
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/ and discussed further below. In operation, data is provided to mainframe computer 120 from 
a data source or sources, such as the source of online transaction data 110. While a single 
source is shown for simplicity, it will be recognized that this source is exemplary only and 
may be representative of a\arge number of sources, such as store data reported for each Wal- 
5 Mart store, for example. To analyze such data, mainframe computer 120 employs database 
software. Large datasets are stored on one or more of the disks 130, 140, or 150 and 
processing is carried out. As addressed in greater detail below, the present invention provides 
techniques for accurately determining when the combined storage limits of the disks 130, 
140, and 150 are approached and generating a report to alert a user. In a presently preferred 

10 embodiment, IMS OS AM collection process \60 automatically emails a user over a 

communication link 162, such as an Internet or mtranet connection, that an exception exists, 
for example, that a threshold has been exceeded, theuiser can print this report on a printer or 
display the report on the display of a personal computet or the like. More importantly, the 
user can take proactive steps to avoid exceeding the system memory capacity. For example, 

15 the user can purge data from the dataset or restructure it, for example, by splitting it. 

On each of the disks 130, 140, and 150 is a volume table of contents (VTOC) that 
contains data about the files on that disk. DCOLLECT, a standard utility to extract 
information about physical files, as addressed further below, runs against all of the VTOC 
and gathers information about the physical files. IMS OSAM is a physical file that can span 

20 multiple volumes or disks. As an example, assume that each of the disks has 2,000 cylinders 
which can be allocated for storing a database or dataset. Assuming a first dataset requires 
3,000 total cylinders of storage is to be stored to disk memory. Assuming the allocation 
begins with disk 130 and then proceeds to disk 140, then disk 130 will be 100% filled with 
data from the dataset. Disk 150 will be 0% filled with data from the dataset. Using tools 
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existing prior to the present invention, disk 140 will correctly show it is 50% full, but disk 
130 will show 0% in many or most cases. 

Similarly, assuming a second dataset requires 5,000 total cylinders of storage. For 
this dataset, it is assumed that allocation begins with disk 150 and then proceeds to disks 130 
5 and 140 in turn. The end result is that disks 130 and 150 are 100% full and disk 140 is 50% 
full. Again, using existing tools, the last disk having data stored, disk 140, correctly shows 
that it is 50% full. However, again in many cases, disks 130 and 150 which are full will be 
shown by the existing tools to be empty. 

In simplified terms, the present invention determines the order in which the disks are 
10 filled, and knowing, for example, that if disk 140 is part full and that it was filled after disk 
130, then adjusting the measure of fullness of disk 130 to 100%. With a correct measure of 
fullness of the three disks 130, 140, and 150, if a threshold is exceeded, an exception report is 
generated. 

Returning to the two examples above with a threshold of 4,750 cylinders, for the first 
15 example, no report will be generated as only 3,000 cylinders are full. However, in the second 
example, using the present invention, it will be correctly determined that 5,000 cylinders have 
been filled and an exception report will be generated. With prior tools, for both examples, it 
is likely that disk 2 140 will be determined to be 50% full with disks 1 and 3 130 and 150, 
respectively, being incorrectly indicated to be empty, thereby, leading a user to inaccurately 
20 believe sufficient disk memory is available for 5,000 additional volumes of data, 1,000 in 
disk 140 and 2,000 each in disks 130 and 150. 

Turning to Figs. 2A-2S, these figures show a process or module 200 for free space 
monitoring and report generation in accordance with the invention. Before discussing 
process 200 in detail, the following additional background regarding how IMS OSAM files 
25 are allocated is provided. 
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IMS OSAM datasets are physical sequential datasets. Guaranteed space is an 
attribute within SMS, and in some cases, this attribute is set on. This attribute changes the 
way that IMS allocates OSAM files, and further aspects of this attribute and allocation will be 
discussed further below. To determine if a dataset is guaranteed space (GTS) or not, an 
5 IDCAMS LISTCAT against the dataset is performed. If the dataset is GTS, according to a 
standard nomenclature of a preferred embodiment utilized by the present assignee, it will 
have a storage class that ends with the letters GTS, for example, STANTGTS. 

DataClass is an SMS function and will give a default number of volumes that a 
dataset can be allocated across. 
10 If the dataset is not guaranteed space, the following rules apply: 



1. 



A primary extent will be taken on the first volume only. 



2. 



Secondary extents will be taken after the primary is taken, even after an 



additional volume is used. 



3. 



If a unit=(3390,n) is coded, where n is some number, that number of candidate 
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volumes will override the SMS data class. For example, if the dataclass is 



vol5 (where a dataset would get 5 volumes) and unit=(3390,2) is coded, only 2 



volumes will be used. But, the number assigned cannot exceed the maximum 



allowed by SMS, so continuing with the vol5 example, if unit=(3390,9) is 



coded, only 5 volumes will be used. 



20 



4. 



A maximum of about 60 extents can be used because of IMS constraints. The 



maximum number of extents on any one volume is 16, but preferably it should 



be kept much lower than that, for example, 7. 



5. 



ISPF 3.4 or ISMF can be used to display space allocations. 



25 



If the dataset is guaranteed space, the following rules apply: 
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1 . A primary allocation will be taken on all volumes. 

a. If the volumes are coded, then a primary allocation will be taken on all 
of them. 

b. If the volumes are not coded, and a unit=(3390,n) is coded, a primary 
5 allocation will be taken in the GTS pool on the number of volumes 

coded. Such an allocation should not be done intentionally. 

c. If the volumes are not coded, and there is no unit coded, then a primary 
allocation will be taken against the number of volumes from the data 
class. Again, such an allocation should not be done intentionally. 

10 d. When converting to GTS from non-GTS, a user should be careful not 

to have the ACS routines changed before the defines are ready. 

2. A secondary allocation will be taken and used on the volume in which the 
initial close occurred following a database load. For example, in the case of a 
first database, Database 1, the close occurs on the last volume as determined by 

15 a listcat. Secondary allocations will only occur on that last volume after all of 

the primary is used on the other volumes. In the case of a second database, 
database2, the close occurs on the second volume. Secondary allocations will 
be taken and used before the last primary is used even though it is already 
allocated. 

20 3. A maximum of 60 extents can be taken. 

4. ISPF and ISMF cannot be reliably used for space determination. The DSCB's 
are not correctly updated by IMS as it was designed at the time the present 
invention was made. 
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Returning to Figs. 2A-2S, process 200 illustrates how IMS allocates OSAM files and 
how IMS OSAM database space calculation and error detection are performed in accordance 
with the present invention. First, the external processes and subroutines are summarized 
below as follows: 

5 

External Processes: 

1. IDCAMS DCOLLECT executes for every volume on the Sysplex. An 
external REXX routine extracts IMS datasets and builds only these files. 

2. This file is sorted in dataset name order. 

10 3. LISTCAT is a standard IDCAMS LISTCAT command. 

4. IEHLIST is a utility that lists all of the data on a DASD device and how much 
free space is left. 



Subroutines: 

1 5 SUBLISTC 300 (Fig. 3) - issues an IDCAMS LISTCAT command for a dataset that 

was passed to it, and puts the results in a dataset. 

SUB009 400 (Figs. 4A-4C) - Reads the output from a SUBLISTC for a dataset and 
returns the following information - gts flag, last volume, total number of volumes and 
20 occurrences of volume serial numbers for a dataset. 

SUB006 500 (Figs. 5A-5cW Reads the output from a SUBLISTC and returns the 
following information - gts flagVnd last volume. 
IEHLISTR 600 (Fig. 6) - Dynamically executes an IEHLIST utility. 
SUBL002 700 (Figs. 7A and 7B) - Reads the output from the IEHLISTR subroutine 
and returns the following information - total free cylinders. 



25 
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After process or module 200 starts after being called in step 201, the process begins 
by reading a first DCOLLECT record in step 202. Next, this first record is parsed for dataset 
name, volume serial or volser, allocated space and used space, in step 204. This information 
initially is determined from the external DCOLLECT process. If the dataset name is 
5 determined to be the same as the previous name in step 206, then process_same_dataset in 
step 208. 

Otherwise, if the dataset name is determined to be not the same as the previous name 
in step 206, it is then determined in step 210 whether this is the first time this dataset has been 
q encountered in this routine. If yes, then a first time process step 212 process_first_time 

yQ 10 proceeds. Otherwise, the dataset is processed as a new dataset, process_new_dataset, in step 
SJ 214. At step 216, process 200 determines if there are more records to be processed. If yes, 

UJ the process proceeds back to step 202 and repeats. When there are no more records to be 

a processed, the process proceeds to step 218 where write_report_ends causes a write report. 

Then, process 200 ends in step 290. 
-B 15 Returning to the PROCESS_FIRST_TIME step 212, first all variables are initialized 

^ in an initialize all variables step 213. Next, total variables are initialized in an initialize total 

variables step 215, and then process 200 proceeds to Process_same_dataset step 208. The 
PROCESS_NEW_DATASET step 214 is followed by a Write_out_dataset step 220. All 
variables are initialized in an iinitialize all variables step 222, which is in turn followed by a 
20 Get_listdsi step 224. Next, a Determirie_total_num_vols operation is performed in step 226, 
which is followed by the Process_same_dataset operation in step 208. 

The PROCESS_SAME_DATASET step 208 adds one to an index and puts the 
variables from step 204 into a current entry list which has a maximum of 20 entries. 

The WRITE_OUT_DATASET_INFO step 226 begins with calling the SUBLISTC 
25 subroutine 300 sublistc using dataset name. Subroutine 300 in turn proceeds to call 
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subroutine 400 sub009. These subroutines are described in further detail below in connection 
with the discussions of Figs. 3 and 4A-4C, respectively. A get step 221 gts_flag and 
all_volsers from sub009 400. The volsers are split in a Split_volsers step 222 and then each 
item is processed serially in step 225. As used here, serial processing means that decision 
5 blocks 227, 229, and 23 1 are all processed serially, regardless of the answer to a preceding 
block. In step 227, it is determined if the variable used is all zeroes, if yes, then space is 
adjusted in an adjust space step 228. If as part of serial processing step 225, it is determined 
in step 229 that if all zeroes and the volsers are out of sync, then ordinates are retrieved in a 
get_ordinates step 230. Alternatively, if an all zeroes determination is made at step 23 1, then 

10 a Total up alloc step 232, followed by a Total_up_used step 234, and a Calc_percent_used 
step 236 result. Following serial processing step 225, a write blank line step 238 occurs. In 
step 240, if gts_flag = "YES", then a display or write message step 241 results. If the 
gts_flag = "NO", it is determined in step 242 if the number of candidates is greater than 90, 
then the display message step 241 again results. If the number of candidates is not greater 

15 than 90 in step 242, the process proceeds to step 244 to calculate the number of volumes left 
based on data class, before proceeding to the write message step 241. 

If it is determined in step 246 that the volumes are out of sync, then display message 
step 247 displays a message to this effect. If it is determined that the volumes are not out of 
sync in step 248, then a message to this effect is displayed in step 249. In step 250, it is 

20 determined whether there is space used. More particularly, if all zeroes is not = zeroes, there 
is space used. If space is used, then Further_process_output step 25 1 proceeds. If not, a 
display message step 252 is followed by a Determine_over_threshold step 254. 

Returning to FURTHER_PROCESS_OUTPUT step 251, a report line is formatted 
for each volser, and then these lines are written. 
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Returning to SPLIT_VOLSERS step 223, all volsers from a subroutine are parsed to 
listcat_volser_tables, and a number_candidates step 233 determines the number of entries. If 
there are more than 20 entries, in other words, if there are more than 20 volumes, there is a 
more serious problem. In step 235, a write message provides an indication of this severe 
5 error as follows. 

ADJUST_SPACE step 228 is followed by a decision step 237. If there are more than 
20 volsers, then a message is played in response to write error step and the exit code is set to 
16 indicative of a severe error. This process looks at the information from the listcat and the 
dcollect to make sure that the order is correct. It sets the used equal to the alloc based on a 
10 switch (found_end_vol). This switch is normally set off, and turned on when a non-zero 
value is found. If the found_end_vol switch is on, then the adjustment is ignored. This is 
because when there is a multi-volume OSAM dataset in IMS, the volumes prior to the initial 
close are set to null, and show zero, even if there is space used. 

Returning to TOTAL_UP_ALLOC step 232, this step adds alloc_space for 
15 each volser. Similarly, TOTAL_UP_USED step 234 adds used_space for each_volser. 

CALC_PERCENT_USED step 236 calculates if the used_space and the alloc_space are not 
equal to zero, then the percent used for volume is calculated. 

With respect to GET_ORDINATES step 230, it is determined if the volser entry = 
listcat volser, by matching listcat to dcollect. Next, the order number is put in the order 
20 variable for the row. 

GET_LISTDSI step 224 gets dataset information for directory, no-recall and 
smsinfo. LISTDSI is a standard REXX routine to determine dataset attributes. 

DETERMINE_TOTAL_NUM_VOLS step 256 strips out the fourth and fifth 
position from the data class and uses them as the number of volumes. This approach is a 
25 standard approach to determining the total number of volumes. 
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WRITE_OUTFILEl step 258 operates to write an error report. 
INITIALIZER ABLE_VARS step 260 sets all initial variables back to base values. 
DETERMINE_OVER_THRESHOLD step 254 determines if percent_used is over 
a threshold in step 255, if there are no more volumes in step 257, and if it is not GTS in step 
5 259. Having made these determinations, the process proceeds to an error routine size step 



Alternatively, if total space is over a threshold, then the process proceeds to an error 
routine space step 265. 

Alternatively, if percent_used is over a threshold in step 267, it is next determined if 



,_□ 10 there are no more volumes in step 269, if it is GTS in step 271, and if there is no more space 



on the last volume in step 273. Having made these determinations, the process 200 proceeds 
to error routine size step 261 . 

In ERROR_ROUTINE_SPACE step 265, a write potential error occurs. For 
ERROR_ROUTINE_SIZE step 261, a write warning error occurs. 



listcat is equal to the number passed from the SUBROUTINE listcat in step 277. Then, the 

lastvolume flag is set equal to that number in step 279. 

For PROCESS_OSAM_GTS step 278, if a space_unit type is not in cylinders 

determination is made in step 280, then in step 28 1 current space is calculated into cylinder 
20 equivalent. A presently preferred approach to this calculation is based on secondary 

allocation. In this approach, it is determined how many cylinders are needed for 1 extent and 

for 2 extents in step 282. This is done to make the math processing a little quicker and to 

save a few lines of calculations. SUBROUTINE sublistc 300 is called in step 283. 

SUBROUTINE sub006 500 is called in step 284. If it is GTS in step 285, then Process_GTS 
25 in step 286. 



261. 



□ 15 



For FIND_LAST_VOLUME step 275, it is determined if the last volume in the 
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PROCESS_GTS step 286 looks up volume information Look_up_vol_info in step 
287. LOOK_UP_VOL_INFO step 287 proceeds to Build_new_sysin_card in step 288, calls 
SUBROUTINE IEHLISTR 600 in step 289, and calls SUBROUTINE SUBL002 700 in step 
291. The above subroutines issue a dynamic IEHLIST that is used to determine how much 
5 free space is on a volume. In step 292, Get total_free_cyls. Based on how much free space is 
left on the volume, one of the two following error routines will be called: 
Write_one_extent_error in step 294 or Write_two_extents_error in step 295. 

BUILD_NEW_SYSIN_CARD step 288 formats a sysin card for an IEHLIST utility 
that will be dynamically executed. WRITE_ONE_EXTENT_ERROR step 294 writes an 

10 output line. WRITE_TWO_EXTENTS_ERROR step 295 writes an output line. 
WRITE_DATASET_NAME step 296 writes an output line. Finally, 
WRITE REPORT ENDS 297 writes output lines for the ending reports. 

Next, we turn to Figs. 3-7 to address in greater detail several presently preferred 
subroutines 300-700 for use in conjunction with process 200. Fig. 3 illustrates an exemplary 

15 subroutine SUBLISTC 300. This subroutine will execute an IDCAMS LISTCAT utility as a 
common subroutine based on information passed through a calling module or process, such 
as process 200. The result information will go to a work fde for further processing by 
another routine. After starting in step 302 upon being called, the process flow proceeds to 
accept a dataset name from the calling module in step 304. An IDCAMS LISTCAT 

20 command is issued in step 306, putting the output to a work file, such as a listcat output in 
step 307. Next, a variable named sub_rc is set to the return code of this module so that the 
calling module can interrogate the results in step 308. Finally, subroutine 300 ends in step 
310. 

Figs. 4A-4C are a flow chart illustrating further details of the presently prefered 
25 subroutine SUB009 400. This subroutine 400 is a common module that will read the output 



-14- 



from the subroutine SUBLISTC 300. After it reads the file, it will pass back to the calling 
module if the volumes are guaranteed space (GTS), how many volumes are used, and what 
these volumes are. The process 400 proceeds as follows once it has started at step 402 after it 
has been called. First, all variables are initialized in step 404. Then, a first record is read in 
step 406. A reformat recs step 408 is done. Next, variables are formatted in a format 
variables step 410. The reformatted variables are returned to the calling module in a return 
variables to calling module step 412. If a record being reformatted in step 408 has dataclass 
or volser indicators, data is accumulated as shown in Figs. 4B and 4C. In step 422, it is 
determined if a line has storage class data and it is determined in step 424 if it is guaranteed 
space or GTS. In step 425, storage class is analyzed to determine does it contain GTS and if 
yes, in step 427, the gts flag is set to "YES". Returning to step 426, it is determined if a line 

has volser data. If yes, in step 428, the line is examined to see if it has " *" in it. If no, in 

step 430, one is added to the number of volumes and in step 432, the line is added to the 
volser. In step 434, the volser is put in the next available bracket. If at step 428 the answer is 
yes, then in step 429, one is added to the number of candidates. 

If there are additional records as determined in step 414, the process returns to step 
408. An additional record is read and the process repeats until it is determined there are no 
additional records in step 414 and process 400 ends in step 416. 

Figs. 5A-5C illustrate an\xemplary subroutine SUB006 500 called by process 200. 
Having been called, subroutine 500 starts in step 502. Variables are initialized in initialize 
variables step 504. A sysprint record is rtoid from the IEHLISTR subroutine 600, which is 
discussed further below, in step 506. The read record is reformatted in step 508 and the new 
variables for the reformatted record are returned ftr the calling module in step 510. 
Subroutine 500 ends in step 512. \ 
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Further details of a presently preferred approach to reformatting records in step 508 
are shown in Fig. 5B. In step 509, it is determined if a line contains a statement "These are 
empty". If yes, if it is determined that the line contains cylinders in step 511, then in step 
513, the free cylinders are established as being the number of free cylinders plus the number 
5 from the current line. If at step 5 1 1, it is determined that the line does not contain cylinders, 
then it is determined in step 515 if the line contains tracks "trks". If yes, the process proceeds 
to step 517 where the number of free tracks is established as the number of free tracks plus 
the number from the current line. 

Fig. 6 illustrates an exemplary IEHLISTR subroutine 600 called by process 200. 

10 Having been called, subroutine 600 starts in step 602. In step 604, subroutine 600 gets 
variables from the calling module. In step 606, these variables are initialized. In step 608, 
the sysin file is allocated. In step 610, sysin data is built from passed variables. Finally, in 
step 612, the IEHLIST utility is executed before subroutine 600 ends in step 614. 

Figs. 7A and 7B illustrate an exemplary subroutine SUBL002 700 called by process 

15 200. Having been called, subroutine 700 starts in step 702. All variables are initialized in 
step 704. In step 706, a record is read. In step 708, it is determined if the record is a storage 
class or storclas line. If yes, it is determined in step 709 if the record is guarantee storage 
"GTS". In making this determination, in step 710, it is determined if there is GTS anywhere 
in the storage class for the record or line. If yes, in step 712, a GTS flag is set to "YES". In 

20 step 714, it is also determined if the record is a volser line. If yes, it is then determined in 

step 716 "Is variable = " *"? If not, then the last volser is established as the current 

variable in step 718. 

Finally, turning to exception reporting, Fig. 8 illustrates an exemplary email report 
800 which is automatically forwarded to a user or users to provide them with a timely 
25 warming that a dataset has exceeded a predetermined threshold. In the exemplary report 800, 
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the dataset name is indicated, for example, name 802. The total kilobytes (kb) allocated, for 
example, allocation data 804 and the total kb used, for example, used data 806, and percent 
used, for example, percent used data 808, are provided. It will be recognized that other 
formats of reports may be readily employed. With a report, such as report 800, the user can 
5 take remedial measures such as purging data from a dataset to reduce its size, splitting a 
dataset or the like before a system failure occurs. 

While the present invention has been disclosed in the context of various aspects of 
presently preferred embodiments, it will be recognized that the invention may be suitably 
adapted to other environments and applications consistent with the claims which follow. 

10 
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